Search CORE

23 research outputs found

FIDEL-a retrovirus-like retrotransposon and its distinct evolutionary histories in the A- and B-genome components of cultivated peanut.

Author: ARRIAL R.
BERTIOLI D.
BERTIOLI S. C. de M. L.
CAMPOS-FONSECA F.
GUIMARAES P. M.
NIELEN S.
SEIJO G.
TOWN C.
Publication venue
Publication date: 20/09/2018
Field of study

Repository Open Access to Scientific Information from Embrapa

Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus Paracoccidioides brasiliensis

Author: C Xue
CH Wu
F Jossinet
G Cochrane
IH Witten
J Kyte
J Liu
JM Otaki
JS Mattick
JW Fickett
K Numata
K Shimizu
KC Pang
L Kong
LA Rymarquis
Marcelo de M Brigido
MC Frith
MS Felipe
N Harte
P Rice
R Teramoto
RJ Carter
Roberto C Togawa
Roberto T Arrial
S Griffiths-Jones
S He
S McGinnis
T Ravasi
VJ Promponas
W Li
WS Noble
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Transcriptome sequences provide a complement to structural genomic information and provide snapshots of an organism's transcriptional profile. Such sequences also represent an alternative method for characterizing neglected species that are not expected to undergo whole-genome sequencing. One difficulty for transcriptome sequencing of these organisms is the low quality of reads and incomplete coverage of transcripts, both of which compromise further bioinformatics analyses. Another complicating factor is the lack of known protein homologs, which frustrates searches against established protein databases. This lack of homologs may be caused by divergence from well-characterized and over-represented model organisms. Another explanation is that non-coding RNAs (ncRNAs) may be caught during sequencing. NcRNAs are RNA sequences that, unlike messenger RNAs, do not code for protein products and instead perform unique functions by folding into higher order structural conformations. There is ncRNA screening software available that is specific for transcriptome sequences, but their analyses are optimized for those transcriptomes that are well represented in protein databases, and also assume that input ESTs are full-length and high quality. Results We propose an algorithm called PORTRAIT, which is suitable for ncRNA analysis of transcriptomes from poorly characterized species. Sequences are translated by software that is resistant to sequencing errors, and the predicted putative proteins, along with their source transcripts, are evaluated for coding potential by a support vector machine (SVM). Either of two SVM models may be employed: if a putative protein is found, a protein-dependent SVM model is used; if it is not found, a protein-independent SVM model is used instead. Only <it>ab initio </it>features are extracted, so that no homology information is needed. We illustrate the use of PORTRAIT by predicting ncRNAs from the transcriptome of the pathogenic fungus <it>Paracoccidoides brasiliensis </it>and five other related fungi. Conclusion PORTRAIT can be integrated into pipelines, and provides a low computational cost solution for ncRNA detection in transcriptome sequencing projects.</p

Repository Open Access to Scientific Information from Embrapa

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The Tetraodon nigroviridis reference transcriptome: Developmental transition, length retention and microsynteny of long non-coding RNAs in a compact vertebrate genome

Author: A Kapusta
A Necsulea
A Pauli
A Stabenau
AJ Vilella
AR Quinlan
B Maher
C Nepal
C Trapnell
C Weaver
CA Watson
CM Smith
D Kim
DR Kelley
F Pelegri
G St. Laurent
GT Williams
H Aanes
H Hezroni
H Roest Crollius
H Roest Crollius
H Tilgner
I Ulitsky
J Harrow
J Kim
J Ponjavic
J Ruiz-Orera
J-W Nam
JB Brown
M Blanchette
M Chorev
M Lohse
MD Robinson
MN Cabili
NT Ingolia
O Jaillon
P Flicek
P Heyn
P Miura
R Arrial
RC Gentleman
S Aparicio
S Basu
S Brenner
S Durinck
S Mathavan
SA Harvey
SS Paranjpe
T Derrien
T Kino
TR Dreszer
V Haberle
W Tadros
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Pufferfish such as fugu and tetraodon carry the smallest genomes among all vertebrates and are ideal for studying genome evolution. However, comparative genomics using these species is hindered by the poor annotation of their genomes. We performed RNA sequencing during key stages of maternal to zygotic transition of Tetraodon nigroviridis and report its first developmental transcriptome. We assembled 61,033 transcripts (23,837 loci) representing 80% of the annotated gene models and 3816 novel coding transcripts from 2667 loci. We demonstrate the similarities of gene expression profiles between pufferfish and zebrafish during maternal to zygotic transition and annotated 1120 long non-coding RNAs (lncRNAs) many of which differentially expressed during development. The promoters for 60% of the assembled transcripts result validated by CAGE-seq. Despite the extreme compaction of the tetraodon genome and the dramatic loss of transposons, the length of lncRNA exons remain comparable to that of other vertebrates and a small set of lncRNAs appears enriched for transposable elements suggesting a selective pressure acting on lncRNAs length and composition. Finally, a set of lncRNAs are microsyntenic between teleost and vertebrates, which indicates potential regulatory interactions between lncRNAs and their flanking coding genes. Our work provides a fundamental molecular resource for vertebrate comparative genomics and embryogenesis studies

Crossref

KITopen

University of Birmingham Research Portal

PubMed Central

Sissa Digital Library

Examples of sequence conservation analyses capture a subset of mouse long non-coding RNAs sharing homology with fish conserved genomic elements

Author: A Pauli
AJ Vilella
AN Khachane
AR Quinlan
B Bánfai
C Camacho
C Carrieri
C Trapnell
CJ Brown
D Licastro
DA Hosack
DR Kelley
DW Huang
DW Huang
Ferenc Müller
G Bejerano
GA Calin
H Jia
I Ulitsky
IA Qureshi
J Ponjavic
J Sheik Mohamed
J-W Nam
JL Rinn
JM Silva
JN Hutchinson
JP McCutcheon
KC Pang
KS Pollard
KS Pollard
L Duret
L Hui
L Kong
LA Pennacchio
M Aoyama
M Guttman
M Lin
ME Dinger
ME Dinger
MN Cabili
NR Zearfoss
NT Ingolia
P Carninci
P Flicek
P Flicek
P Flicek
PP Amaral
PP Amaral
R Arrial
RA Chodroff
Remo Sanges
S Haider
S Katayama
S Washietl
SE Seemann
SJ Hubbard
SR Eddy
Swaraj Basu
T Fawcett
T Gesell
T Kino
T Ota
T Sing
T-K Kim
TR Dreszer
TR Mercer
TR Mercer
UA Ørom
Y Okazaki
Y Sakuraba
Y Zhou
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Background: Long non-coding RNAs (lncRNA) are a major class of non-coding RNAs. They are involved in diverse intra-cellular mechanisms like molecular scaffolding, splicing and DNA methylation. Through these mechanisms they are reported to play a role in cellular differentiation and development. They show an enriched expression in the brain where they are implicated in maintaining cellular identity, homeostasis, stress responses and plasticity. Low sequence conservation and lack of functional annotations make it difficult to identify homologs of mammalian lncRNAs in other vertebrates. A computational evaluation of the lncRNAs through systematic conservation analyses of both sequences as well as their genomic architecture is required.Results: Our results show that a subset of mouse candidate lncRNAs could be distinguished from random sequences based on their alignment with zebrafish phastCons elements. Using ROC analyses we were able to define a measure to select significantly conserved lncRNAs. Indeed, starting from ~2,800 mouse lncRNAs we could predict that between 4 and 11% present conserved sequence fragments in fish genomes. Gene ontology (GO) enrichment analyses of protein coding genes, proximal to the region of conservation, in both organisms highlighted similar GO classes like regulation of transcription and central nervous system development. The proximal coding genes in both the species show enrichment of their expression in brain. In summary, we show that interesting genomic regions in zebrafish could be marked based on their sequence homology to a mouse lncRNA, overlap with ESTs and proximity to genes involved in nervous system development.Conclusions: Conservation at the sequence level can identify a subset of putative lncRNA orthologs. The similar protein-coding neighborhood and transcriptional information about the conserved candidates provide support to the hypothesis that they share functional homology. The pipeline herein presented represents a proof of principle showing that a portion between 4 and 11% of lncRNAs retains region of conservation between mammals and fishes. We believe this study will result useful as a reference to analyze the conservation of lncRNAs in newly sequenced genomes and transcriptomes. \uc2\ua9 2013 Basu et al.; licensee BioMed Central Ltd

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

PubMed Central

Sissa Digital Library

FIDEL—a retrovirus-like retrotransposon and its distinct evolutionary histories in the A- and B-genome components of cultivated peanut

Author: A Brandes
A Kumar
AP Fávero
B SanMiguel
B Yüksel
BD Schrire
C Cheng
C Feschotte
C Snider
C Vitte
Christopher Town
CM Vicient
D Armisén
D Bertioli
D Fonceka
D Grattapaglia
DA Wright
DA Wright
David Bertioli
E Fukai
EM Temsch
EM Temsch
F Chavanne
F Sabot
F Sabot
Fernando Campos-Fonseca
G Kochert
G Seijo
Guillermo Seijo
H Ohtsubo
HM Laten
J Greilhuber
J Maluszynska
JD Thompson
JG Seijo
JL Bennetzen
JR Wortman
JS Hawkins
JS Heslop-Harrison
JX Ma
JY Lin
K Alix
K Kashkush
K Shirasu
K Tamura
KM Devos
KP Singh
LH Madsen
MD Bennett
MD Bennett
MD Burow
P Fransz
P Neumann
P Neumann
P Neumann
P SanMiguel
Patricia Guimarães
PM Guimarães
PS Schnable
R Kalendar
R Staden
RO Hammons
Roberto Arrial
RW Michelmore
S Nielen
S Tabata
SF Altschul
SN Raina
Soraya Leal-Bertioli
SR Pearce
ST Yano
Stephan Nielen
T Pélissier
T Schwarzacher
TH Jukes
WL Gerlach
WL Gerlach
X Huang
XP Zhao
XY Zhang
Y Xiong
YL Orlov
ZL Liu
Publication venue: Springer Netherlands
Publication date: 01/01/2010
Field of study

In this paper, we describe a Ty3-gypsy retrotransposon from allotetraploid peanut (Arachis hypogaea) and its putative diploid ancestors Arachis duranensis (A-genome) and Arachis ipaënsis (B-genome). The consensus sequence is 11,223 bp. The element, named FIDEL (Fairly long Inter-Dispersed Euchromatic LTR retrotransposon), is more frequent in the A- than in the B-genome, with copy numbers of about 3,000 (±950, A. duranensis), 820 (±480, A. ipaënsis), and 3,900 (±1,500, A. hypogaea) per haploid genome. Phylogenetic analysis of reverse transcriptase sequences showed distinct evolution of FIDEL in the ancestor species. Fluorescent in situ hybridization revealed disperse distribution in euchromatin and absence from centromeres, telomeric regions, and the nucleolar organizer region. Using paired sequences from bacterial artificial chromosomes, we showed that elements appear less likely to insert near conserved ancestral genes than near the fast evolving disease resistance gene homologs. Within the Ty3-gypsy elements, FIDEL is most closely related with the Athila/Calypso group of retrovirus-like retrotransposons. Putative transmembrane domains were identified, supporting the presence of a vestigial envelope gene. The results emphasize the importance of FIDEL in the evolution and divergence of different Arachis genomes and also may serve as an example of the role of retrotransposons in the evolution of legume genomes in general

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

CONICET Digital

Springer - Publisher Connector

PubMed Central

De Novo assembly and transcriptome analysis of the mediterranean fruit fly ceratitis capitata early embryos

Author: A Crisanti
A Heffer
A Mortazavi
A Pane
AD Diamantidis
AE Wimmer
AF Ha Harris
AR Malacrida
AR Malacrida
Archana Tomar
B Daines
B Ewen-Campen
B Langmead
B Li
B Li
B Vicoso
BA Carvalho
BJ Haas
BM Wiegmann
D Bachtrog
D Blankenberg
D Bopp
D Bopp
D Lagos
D Marchini
DA Peel
E Frise
E Jiménez-Guri
EC Ogaugwu
EC Verhulst
EC Verhulst
F Catteruccia
F Criscione
F Niazi
F Scolari
G Burghardt
G Franz
G Fu
G Gao
G Saccone
G Saccone
Giuseppe Saccone
GK Davis
GM Shen
Hongyu Zhang
J Intra
J Martínez-Barnetche
J Nagaraju
J Zhou
Javaregowda Nagaraju
JD Evans
JJ Walker
JW Erickson
Kallare P. Arunkumar
KE Hokanson
L Alphey
L Kong
L Vannini
L Wang
LJ Zwiebel
LM Gomulski
LM Metzker
M Hediger
M Koukidou
M Salvemini
M Salvemini
M Salvemini
MA Handler
Marco Salvemini
MF Schetelig
MF Schetelig
MG Grabherr
ML Spletter
MS Beverley
NI Morrison
NJ Shukla
OS Akbari
P Gabrieli
P Tomancak
P Topalis
PA Papathanos
Pedro L. Oliveira
PK Arunkumar
R Lacroix
R Schmieder
R Weiszmann
Remo Sanges
RJ Carey
RT Arrial
S Lemke
S Zhao
SA Robinson
ST O'Neil
T Dafa'alla
T Gempe
T Kiuchi
TG Loukeris
TW Chen
Valeria Petrella
W Zheng
Weiwei Zheng
WT Cline
X Nirmala
Y Zheng
Z Li
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

The agricultural pest Ceratitis capitata, also known as the Mediterranean fruit fly or Medfly, belongs to the Tephritidae family, which includes a large number of other damaging pest species. The Medfly has been the first non-drosophilid fly species which has been genetically transformed paving the way for designing geneticbased pest control strategies. Furthermore, it is an experimentally tractable model, in which transient and transgene-mediated RNAi have been successfully used. We applied Illumina sequencing to total RNA preparations of 8-10 hours old embryos of C. capitata, This developmental window corresponds to the blastoderm cellularization stage. In summary, we assembled 42,614 transcripts which cluster in 26,319 unique transcripts of which 11,045 correspond to protein coding genes; we identified several hundreds of long ncRNAs; we found an enrichment of transcripts encoding RNA binding proteins among the highly expressed transcripts, such as CcTRA-2, known to be necessary to establish and, most likely, to maintain female sex of C. capitata. Our study is the first de novo assembly performed for Ceratitis capitata based on Illumina NGS technology during embryogenesis and it adds novel data to the previously published C. capitata EST databases. We expect that it will be useful for a variety of applications such as gene cloning and phylogenetic analyses, as well as to advance genetic research and biotechnological applications in the Medfly and other related Tephritidae

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Sissa Digital Library

FigShare